Robust speech recognition with spectral subtraction in low SNR

نویسندگان

  • Randy Gomez
  • Akinobu Lee
  • Hiroshi Saruwatari
  • Kiyohiro Shikano
چکیده

Speech recognition in noisy environments is a very difficult task. It is is desirable to search for parameters that would relate the speech enhancement technique directly with the recognizer to optimize the recognition performance. In this paper, Noise Reduction Rate (NRR) and Mel Cepstrum Distortion (MelCD) are investigated when using Spectral Subtraction (SS). Under low SNR such as 0dB,5dB and 10dB, maximizing NRR nor minimizing the MelCD does not result in a better recognition performance. Thus, the conventional SS in which the oversubtraction parameter is a function of SNR renders to be ineffective in the point-of-view of the recognizer. Our proposed method derives for SS directly from the training utterances used in creating the Hidden Markov Models (HMM) that optimizes the recognition performance. By superimposing office noise to the SS-denoised noisy speech, we achieved 26.0% and 7.6% for relative increase in word accuracy for the proposed matched and generalized respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Modified Speech Enhancement Using Adaptive Gain Equalizer with Non linear Spectral Subtraction for Robust Speech Recognition

In this paper we present an enhanced noise reduction method for robust speech recognition using Adaptive Gain Equalizer with Non linear Spectral Subtraction. In Adaptive Gain Equalizer method (AGE), the input signal is divided into a number of subbands that are individually weighed in time domain, in accordance to the short time Signal-to-Noise Ratio (SNR) in each subband estimation at every ti...

متن کامل

Spectral Subtraction Using Spectral Harmonics for Robust Speech Recognition in Car Environments

This paper addresses a novel noise-compensation scheme to solve the mismatch problem between training and testing condition for the automatic speech recognition (ASR) system, specifically in car environment. The conventional spectral subtraction schemes rely on the signal-to-noise ratio (SNR) such that attenuation is imposed on that part of the spectrum that appears to have low SNR, and accentu...

متن کامل

Missing data theory, spectral subtraction and signal-to-noise estimation for robust ASR: an integrated study

In the missing data approach to robust Automatic Speech Recognition (ASR), time-frequency regions which carry reliable speech information are identified. Recognition is then based on these regions alone. In this paper, we address the problem of identifying reliable regions and propose two criteria to solve this based on negative energy ( $ s < 0 ) and SNR ( $ s s n 2 2 2 < + ). These criteria a...

متن کامل

Improving the performance of MFCC for Persian robust speech recognition

The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...

متن کامل

A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech

The weakness of conventional spectral subtractive-type algorithm is identified and presented in Section 2. The proposed remedial approach is described in Section 3. In Section 4, we show the proposed method’s effectiveness over conventional methods with representative experiments using Aurora 2. Concluding remarks are provided in Section 5. This paper addresses a novel noise-compensation scheme...

متن کامل

Isolated Word Recognition Based on Combination of Multiple Noise-Robust Techniques

Although many noise-robust techniques have been presented, the improvement under low SNR condition is still insufficient. The purpose of this paper is to achieve the high recognition accuracy under low SNR condition with low calculation costs. Therefore, this paper proposes a novel noise-robust speech recognition system that makes full use of spectral subtraction (SS), mean variance normalizati...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004